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ABSTRACT 

This  report  provides  a  technical  overview  of  research  supported  under  an  ASSERT  (Aug¬ 
mentation  Awards  for  Science  and  Engineering  Research  Training)  grant  administered  by  the 
U.S.  Air  Force  Office  of  Scientific  Research.  The  grant  was  tied  to  a  parent  grant  AFOSR-90- 
0226  and  its  follow-on  F49620-96- 1-0059,  both  of  which  bear  the  title  The  Nonlinear  Control 
Theory  of  Complex  Mechanical  Systems.  The  period  of  performance  of  the  subject  ASSERT 
grant  was  September,  1994,  through  August  31,  1997.  Funds  from  this  grant  were  used  to  sup¬ 
port  the  PhD  thesis  research  of  Geoffrey  Howell  as  stipulated  in  the  original  proposal  submitted 
in  the  Fall  of  1993.  Mr.  Howell  is  expected  to  finish  his  dissertation  and  all  degree  requirements 
in  1998.  The  research  described  in  this  report  (and  which  will  be  reported  in  greater  detail  in 
Geoff  Howell's  dissertation)  is  concerned  with  two  interrelated  themes  in  the  control  of  super- 
articulated  (or  underactuated)  mechanical  systems:  (1)  how  to  control  the  degrees  of  freedom 
which  are  directly  actuated  without  eliciting  undesired  behavior  in  the  unactuated  degrees  of 
freedom,  and  (2)  how  to  prescribe  motions  of  the  directly  actuated  degrees  of  freedom  to 
achieve  motion  objectives  for  the  degrees  of  freedom  which  are  not  directly  controlled. 


Contents 


•  t 


1  Introduction  and  Report  Summary  3 


2  Control  of  Single  Degree  of  Freedom  Hamiltonian  Systems  with  Impulsive 
Inputs  5 


3  A  Simple  Actuated  Walking  Mechanism  Controlled  by  the  Dynsunics  of 
an  Appended  Mass  20 


1 


1  Introduction  and  Report  Summary 


The  research  discussed  below  has  been  aimed  at  a  number  of  the  mathematical  and  engineer¬ 
ing  problems  arising  in  planning  and  controlling  the  motions  of  complex  vehicles  containing 
elastic,  fluid,  and  other  types  of  components  which  may  store  energy.  This  work  is  closely 
tied  to  a  larger  body  of  research  of  the  P.I.  (Baillieul)  on  the  control  theory  of  complex 
mechanical  systems.  Since  the  beginning  of  the  ASSERT  project,  the  scope  of  the  work  has 
been  broadened  somewhat  to  study  the  use  of  various  energy-storing  appendages  to  provide 
primary  actuation  in  simple  bipedal  walking  machines.  The  funds  provided  by  the  grant 
have  supported  the  PhD  studies  of  Geoffrey  Howell.  Mr.  Howell’s  research  has  progressed 
to  the  point  that  the  award  of  his  (PhD)  degree  is  expected  in  1998.  His  thesis  will  docu¬ 
ment  both  theoretical  and  experimental  work  on  motion  planning  and  control  of  autonomous 
mobile  mechanisms.  Technical  details  of  some  of  his  work  are  reported  here.  The  broader 
technical  issues  associated  with  our  ongoing  work  on  the  control  of  mechanical  systems  have 
previously  been  reported  in: 


1.  Final  Report  of  AFOSR-90-0226  The  Nonlinear  Control  Theory  of  Complex  Mechanical 
Systems  by  J.  Baillieul,  submitted  on  April  29,  1996. 

2.  Annual  Progress  Report  of  F49620-96- 1-0059  The  Nonlinear  Control  Theory  of  Com¬ 
plex  Mechanical  Systems  by  J.  Baillieul,  submitted  on  October  15,  1997. 


We  refer  to  these  reports  for  further  details,  and  we  note  that  the  more  recent  of  the  two 
reports  cites  fifty  one  (51)  archival  publications  which  collectively  document  the  research 
supported  under  both  the  ASSERT  and  parent  grants. 

The  principal  topics  of  the  present  report  are: 


1.  Control  of  Single  Degree  of  Freedom  Hamiltonian  Systems  with  Impulsive  Inputs 
(Chapter  2  below),  and 

2.  Analysis  of  the  Pendulum-driven  Rimless  Wheel  (Chapter  3  below). 


An  edited  version  of  the  paper  appearing  as  Chapter  2  appeared  in  the  Proceedings  of  the 
1996  IEEE  Conference  on  Decision  and  Control. 

The  unifying  theme  of  the  research  herein  described  is  the  multiobjective  control  of  mechan¬ 
ical  systems.  Howell’s  thesis  will  be  concerned  in  part  with  controlled  motions  of  simple 
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vehicles  having  dynamic  loads  (elastic  bodies,  fluid  filled  tanks,  etc.)  which  can  store  energy. 
The  objective  in  these  control  problems  is  to  execute  motions  between  prescribed  points 
while  managing  the  dynamics  of  the  loads,  which  are  not  directly  controlled  and  which  re¬ 
spond  to  the  motions  of  the  vehicles.  For  instance,  in  moving  a  vehicle  between  prescribed 
points,  we  shall  typically  want  the  internal  energy  to  be  zero  at  the  beginning  and  ending 
point.  Other  energy  management  objectives  can  also  be  considered. 


The  next  two  sections  of  the  report  treat  some  specific  model  systems  in  which  we  use  directly 
controlled  degrees  of  freedom  to  accomplish  specified  motion  objectives  for  the  degrees  of 
freedom  in  the  system  which  are  not  directly  controlled.  The  swing-up  and  stabilization  of 
a  pendulum  on  a  cart  using  impulsive  inputs  illustrates  a  novel  approach  to  control  which  is 
relatively  straightforward  to  implement  (as  reported)  while  at  the  same  time  being  applicable 
to  a  wider  class  of  nonlinear  mechanical  systems  (work  in  progress).  This  research  is  part 
of  a  continuing  effort  to  understand  how  the  geometry  of  the  wave  form  of  an  input,  and  in 
particular  its  frequency  content,  affects  the  dynamics  of  controlled  nonlinear  systems.  Our 
goal  is  to  understand  how  actuator  inputs  should  be  designed  so  as  to  be  maximally  effective 
in  controlling  the  degrees  of  freedom  which  are  not  directly  actuated  in  a  super-articulated 
mechanism.  Motivated  by  the  signal  characteristics  seen  in  biological  motor  control,  we  have 
studied  impulsive  inputs  to  control  multiple  pendulum  systems.  While  multiple  pendula 
typically  exhibit  chaotic  responses  to  smooth  periodic  forcing  (e.g.  by  sinusoids),  we  have 
found  that  carefully  designed  impulse  trains  can  be  used  to  elicit  a  variety  of  interesting 
dynamic  responses  in  such  systems.  The  next  chapter  discusses  this  circle  of  ideas  for  a  class 
of  controlled  single  degree  of  freedom  systems. 

Another  goal  of  our  research  on  energy  management  in  autonomous  vehicles  is  to  design  biped 
robots  where  walking  cycle  in  actuated  by  oscillatory  motions  of  an  attached  pendulum. 
Completely  passive  biped  robots  have  been  constructed  (by  other  researchers)  which,  given 
proper  initial  conditions,  walk  without  intervention  down  slightly  sloping  ramps.  The  energy 
lost  at  each  (totally  inelastic)  footfall  is  restored  by  the  conversion  of  gravitational  potential 
energy  to  kinetic  energy.  Our  contribution  to  this  line  of  research  is  to  add  a  controlled 
pendulum-like  appendage  to  the  passive  walker  which  my  be  ’’swung”  in  carefully  designed 
patterns  to  cause  the  biped  to  walk  on  level  surfaces  and  even  up  slight  slopes.  Preliminary 
analysis  of  such  a  mechanism  appears  in  Chapter  3,  below,  and  more  complete  results  will 
appear  in  the  forthcoming  thesis  of  G.W.  Howell. 
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Control  of  Single-Degree-of-Freedom  Hamiltonian  Systems 

with  Impulsive  Inputs 

S.  Weibel  G.  Howell 

Dept,  of  Aero/Mech  Engineering  Dept,  of  ECS  Engineering 
Boston  University  Boston  University 

Boston,  MA  02215  Boston,  MA  02215 

J.  Baillieul 

Dept  of  Aero/Mech  Engineering 
Boston  University 
Boston,  MA  02215 


Abstract 

In  this  paper,  we  develop  a  theory  for  the  feedback  control  of  single  degree-of-freedom  Hamil¬ 
tonian  systems,  utilizing  inputs  which  approximate  6-functions.  The  main  idea  is  that  by  using 
“spikey”  control  inputs,  it  is  possible  to  force  the  system  to  jump  between  energy  levels.  Since 
there  is  a  correspondence  between  energy  levels  and  orbits  for  Hamiltonian  systems,  this  control 
of  system  energy  results  in  control  of  the  system  dynamics.  Impulsive  control  has  proven  useful 
in  simulations  and  experiments  involving  the  global  reconfiguration  of  mechanical  systems.  We 
present  application  of  the  control  to  the  cart  and  pendulum  swing-up  problem. 


1  Introduction 

Recent  and  planned  real-time  control  implementations  in  our  laboratory  (See  [7])  have  led  us  to 
study  the  use  of  intermittent  actuator  pulses  to  control  complex  physical  systems.  While  there  are 
elements  of  modern  digital  control  theory  which  are  useful  in  developing  pulse-modulated  control 
designs,  there  are  several  interesting  issues  which  are  not  fully  addressed  by  existing  theory.  One 
general  question,  which  we  treat  here  in  the  special  case  of  low-order  Hamiltonian  control  systems, 
involves  the  timing  of  pulses.  Motivated  by  results  from  biological  motor  control,  ([5]),  we  find  that 
the  timing  of  actuator  pulses  can  be  effectively  tailored  to  produce  the  maximally  efficient  transmis¬ 
sion  of  energy  through  a  complex  multibody  system.  For  the  systems  studied  below,  pulsed  inputs 
are  most  effectively  applied  when  the  system  passes  near  to  certain  “advantageous”  points  in  the 
phase  space.  At  these  advantageous  points,  which  lie  along  easily  characterized  curves  in  the  phase 
plane,  the  system  is  maximally  able  to  absorb  energy  from  the  pulse.  Feedback  of  a  non-classical 
type  thus  plays  a  role  in  our  control  theory  of  pulse  modulated  systems.  While  a  general  discussion 
of  pulse-shaping  and  pulse-timing  for  motion  control  of  legged  robots,  snake-like  robots,  and  other 
mechanisms  is  beyond  the  scope  of  the  present  paper,  we  do  provide  very  explicit  results  on  optimal 
pulse  timing  for  a  broad  class  of  low-order  mechanical  systems. 

The  paper  is  organized  as  follows.  In  Section  2,  we  describe  the  class  of  single  degree-of-freedom 
(SDOF),  velocity-controlled  Hamiltonian  systems  which  are  the  principal  objects  of  study  in  the 
paper.  The  implementation  of  impulsive  inputs  is  discussed,  and  a  general  formula  for  “optimal” 
application  points  in  the  phase  space  is  given.  (The  objective  is  to  apply  an  impulsive  input  to  add 
or  remove  the  maximum  possible  amount  of  total  energy  from  the  given  degree  of  freedom.)  Section 
3  discusses  control  synthesis,  taking  into  account  the  possibility  of  actuator  saturation.  Section  4 
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applies  the  method  to  the  example  of  swing-up  control  of  a  pendulum  on  a  cart.  Both  damped 
and  undamped  pendulum  dynamics  are  considered,  and  the  results  here  indicate  that  the  idealized 
Hamiltonian  models  are  useful  in  guiding  the  design  of  control  laws  for  mechanical  systems  in  which 
damping  is  present.  We  also  describe  an  actual  experimental  implementation  in  which  there  is  the 
added  complexity  of  a  fairly  short  track  on  which  the  cart  is  allowed  to  travel. 


2  SDOF  Hamiltonian  Systems  and  Impulsive  Inputs 

2.1  Impulse  Response  of  SDOF  Hamiltonian  Systems 

Consider  Hamiltonian  systems  of  the  form 

^  [P  +  mf{q)U (i)]^  +  Va{q,  U)  (1) 

where  Va{^^U)  is  the  augmented  potential  [2] 

Vaiq,U)  =  -^-mUity-  +  Viq) 

and  where  U{t)  =  u(r)dr,  /  G  C^{q),  and  I4  G  C^{q)y  >  2.  Systems  of  this  type  arise  from 
a  reduction  procedure  described  in  [2]  for  a  class  of  velocity-controlled,  single  degree-of-freedom 
mechanical  systems.  The  equations  of  motion  corresponding  to  (1)  are 

4  =  —\p+mf{q)Uit)] 

m 

p  =  ^\p+rnf{q)U{t)]^U{t)-^^  (2) 

We  wish  to  use  u{t)  to  produce  desired  changes  in  (unforced,  i.e.  U{t)  =  u{t)  =  0)  system  energy 

=  (3) 

where  we  have  introduced  the  shorthand  notation  =  H{p,q;0).  From  (1)  and  (2),  the 

instantaneous  change  in  energy  is 

(0)  /(?)]  w(<)  =  F{p,  q,  u)u{t) 

The  change  in  energy  on  a  time  interval  <o  <  ^  is  obtained  by  integrating  dH fdt: 

AH{p,q-,U)=  [  F{p{t),q{t),U{t))u{t)dt  (4) 

Jto 

The  goal  of  this  paper  is  to  show  how  to  prescribe  A  if  by  appropriate  choice  of  w(*).  The  emphasis 
will  be  on  impulsive  inputs,  the  most  basic  model  for  which  is  u{t)  =  S{t  — i')  for  to  <t,f<ti.  With 
this  choice,  the  corresponding  U{-)  in  (4)  is  the  Heaviside  step  function,  and  some  care  is  involved  in 
evaluating  the  product  u{i)U{t).  (See,  e.g.,  [9].)  Since  we  concentrate  on  actual  implementations  in 
which  impulses  are  well  approximated  by  narrow  square  waves,  it  will  be  natural  to  adopt  the  (fairly 
standard)  convention  ([9])  that  the  product  is  given  by  u{t)U{t)  =  —t^).  More  specifically, 

suppose 

r  0  if  f  <  /' 

Uw{t)=  <  ^  if  f  <t  <f -hw  .  (5) 

1^  0  at  >f-\-w 
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Then,  we  take 


Uroit) 


0  i{t<t' 

^  iff  <t 
1  at  >i' w 


If  G{t)  is  any  continuous  function,  and  to  <  t'  <  ti  -  w,  then 


(6) 


l>ti  1  rt’+W 

Git)u^{t)Uy,{t)dt  =  —  G{t){t-i^)dt. 


Elementary  calculus  (rHopital’s  rule)  shows  that  the  limit  as  u;  — ^  0  is  Hence,  the  change  in 

energy  associated  with  the  impulsive  input  u{t)  =  kS{t  —f)  (where  k  is  some  “gain”  coefficient)  is 

=  kpf{q)  +  ^mf{qf. 


Using  the  square- wave  approximation  u^{t)  for  6{t  —f),  we  write 

1  pt*  +  w 

A’‘HUP,q)  =  -  Fipit),qit),UUt))dt.  (7) 

w  Jt' 

We  shall  make  use  of  the  fact  that  limu,^o  A'-.ffu,  =  A^H,  and  more  specifically  we  shall  use  the 
following  theorem. 


Theorem  1  There  exist  numbers,  K  >  0  and  0  <  eo(Pj  9)  •C  1  such  that  for  0  <  e  <  eo, 

\{A’‘H,„lJ-AtH\<Ke 


Proof:  Without  loss  of  generality,  let  t'  —  0.  We  expand  in  a  Taylor  series  around  e  =  0: 


<=o 


r  .  r\  fdF .  dF  .  dF 

F{p,q,Uw)dt  =  €F{po^qo,Uo)-\- 

with  po  =  p(0),  qo  =  g(0),  and  Uq  =  U{0).  By  (2)  and  17(0)  =  0,  we  can  show 

dF  .  dF  .  dF  \ 

where  K  is  an  0(1)  constant.  Therefore 

||A*77,„  -  AlH\\  <-  f  [Kt  +  0{t^)]  dt  <  Ke 
^  Jo 

which  proves  the  theorem.  □ 


t  +  0{t^) 


dt  , 


2.2  Maximal  and  Minimal  Application  Points 

Suppose  we  have  a  system  moving  freely  on  an  orbit  of  energy  Ho-  Denote  the  set  of  points  on 
this  orbit  So  =  {(90, Po)  :  Ho{qo,Po)  =  Hq}-  For  every  point  in  So,  AsH  =  AjF  defines  a  mapping 
AfH  :  So  X  i?  — >■  i2,  where  we  use  the  fact  that  90  and  po  are  related  by  i7o-  It  is  only  natural  to  ask 
which  points  in  So  maximize  or  minimize  AsH.  Extremal  points  {p{qext.  Ho),  9ert))  may  be  shown 
to  satisfy  the  following  equation: 

f{q)^  -  2m^  (Ho  -  V^iq))  =  ±fiq)^V‘^m{Ho-V{q)).  (8) 

We  define  those  points  which  minimize  and  maximize  \AsH\  as  follows. 
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Definition  1  A  point  {p{qexti  ^o),  qext)  €  Do  is  called  a  maximal  application  point  ifqext  maximizes 
l^^^isol*  points  may  be  found  as  solutions  of  equation  (8).) 

Definition  2  A  point  {p{q,  Hq)^  q)  G  Do  is  called  a  minimal  application  point  if  (g)  =  0. 

Remark  1  On  some  orbits  Do,  (q)  is  never  zero.  (This  happens,  for  instance,  if  Do  consists 

of  a  single  equilibrium  point  of  the  unforced  dynamics.)  Hence  minimal  application  points  defined 
in  this  way  may  not  exist  on  some  orbits. 

As  suggested  by  the  presence  of  the  absolute  values  in  the  preceding  definitions,  our  interest  is 
in  the  magnitude  of  the  energy  change  produced  by  an  impulse.  We  shall  see  in  following  sections 
that  knowledge  of  the  minimal  and  maximal  application  points  is  useful  in  control  synthesis. 


3  Control  Synthesis  for  Planar  Hamiltonian  Systems 

3*1  Shaping  the  Control  Input 

The  theorem  of  §2  provides  the  basis  for  control  synthesis  with  impulsive  inputs.  In  control  synthesis, 
our  goal  is  to  design  a  rectangular  pulse  which  produces  a  desired  change  in  energy.  Suppose  we 
choose  our  control  input  u{t)  to  be 


u(t)  =  kr{t)  =  I 


t  G  [t' ,  t^  +  w) 
i  ^  [t'j  t'  4-  u;) 


(9) 


where  ^  is  a  constant  gain  we  will  determine  later,  and  r{t)  is  a  unit  area  rectangular  pulse.  Assuming 
w  is  sufficiently  small,  by  Theorem  1, 


rt*+w 

Jv 


F{p{t),q{t),U{t))r{t)dt  =  +  0{w), 


where  AgH{p(t'),q{t'))  =  kp{t')f{q{f))  +  Then  given  some  desired  energy  state  Hd, 

we  choose  k  such  that 

Hd  -  Ho{p',  q')  =  A\H{p{t'),  q{t'))  +  0{w).  (10) 

where  =  (p(t'),  We  compute  the  ideal  gain  k^  from  the  above  equation:  assuming 

f(q(t'))  zji  0,  let  y  be  the  smaller  of  the  two  roots  of  the  quadratic  equation 

YT) 

Hd  -  Hoip',  q')  =  kp'fiq')  +  —fiq'?-  (11) 

Remark  2  It  is  interesting  that  there  are  generally  two  roots  to  this  “gain  equation.”  The  physical 
reason,  of  course,  is  that  our  input  should  be  thought  of  as  a  velocity,  and  this  velocity  may  be 
applied  in  two  different  directions  with  different  “levels  of  effort”  to  produce  a  desired  AH.  Note 
that  it  might  happen  that  Hd  —  Ho{p\  g')  is  negative  and  of  large  magnitude  such  that  there  are  no 
real  gains  k  satisfying  (11).  We  shall  for  the  moment  assume  this  does  not  happen,  and  we  treat 
this  case  separately  below. 


If  there  are  no  modeling  errors  (i.e.  if  (l)-(2)  describe  the  physical  system  exactly),  then  in  the 
limit  as  ►  0,  the  ideal  gain  k^  transfers  the  system  from  the  initial  energy  state  Hq  to  a  final 
energy  state  Hd  in  one  step.  In  the  non-ideal  case  in  which  pulses  have  finite  non-zero  width  u;, 
we  shall  show  that  under  mild  hypotheses  a  sequence  of  rectangular  pulse  inputs  may  be  defined 
which  steer  the  energy  level  toward  Hd.  A  sufficient  condition  to  be  able  to  define  such  a  sequence 
of  pulses  is  that  the  zeros  of  /(•)  are  disjoint  from  the  critical  points  of  the  potential  function  ¥{•) 
and  do  not  accumulate  at  a  critical  point  of  K(').  Under  this  hypothesis,  we  show  that  the  system 
cannot  get  stuck  at  a  state  (g',p')  where  A^H  =  0.  This  is  the  essential  content  of  the  following 
theorem. 
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Theorem  2  Suppose  there  is  given  a  Hamiltonian  H{p,q;U)  as  in  (1)  with  corresponding  equations 
of  motion  (2).  We  further  suppose  that  /(•)  and  7(-)  are  analytic  functions  and  that  for  some  rjo  >  0 

(i)  For  0  <  T]  <  rjo,  the  set  {q  :  fiq)^  <  f]]  is  composed  of  disjoint,  open,  finite  intervals,  each  of 

length  at  most  €{t])  where  e{r})  =  0{rj)  as  7]  0,  and 

(ii)  {? :  f(q?  <  no}  n  {<? :  |f^(<z)|  <  no)  =  0. 

Lei  Ho  be  some  initial  energy  state  and  Hd  some  desired  final  energy  state.  Letw  >  0  be  given  such 

that  w  1.  Consider  a  sequence,  of  times  to  <  ti  <  t^  <■• .  such  that  (ti  —  U-i)  >  w,  and  define 

the  sequence  of  rectangular  pulses 


Wi(0 


^  i  S  +  zy) 

0  t  £[ti.i+w,ti) 


(12) 


smallest  root  k  of  Hd  —  Hi-i  =  kp'f{q')  + 

0 


if  fWy  >  n 

otherwise. 


(here  (p,_i,gi_i)  =  (p(<.-i),  9(<i-i))  and  =  H{p{ti-i),q{ti-i)).  Then  the  sequence  Hn  = 

HoiPnAn)  Hd  as  n ->-00. 


Proof  Define  for  each  t]  >  0 

=  { iq,p)  :  \pf{q)\  +  <  n]- 


We  claim  that  any  unforced  trajectory  of  (2)  which  enters  or  begins  in  will  leave  after  a 
finite  amount  of  time.  Indeed,  if  (?,?)  €  B^o,  then  f-/(g)^  <  no,  and  under  the  hypothesis  of  the 
theorem,  |f^(g)l  >  »7o-  Hence,  |p|  >  r)o,  and  we  shall  assume,  without  loss  of  generality,  that  p  >  rjo- 
Under  our  hypothesis  on  /(•),  the  set  B^„  is  the  union  of  regions  in  the  phase  plane  of  the  form 
depicted  in  Figure  1.  Unforced  trajectories  of  (2)  evolve  according  to 
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and  if  (9(^0), p(^o))  €  then  either  {q{t),p{i))  leaves  (trajectory  (a)  in  Figure  1)  or  p{t)  be¬ 
comes  positive  (trajectory  (b)).  Ifp(^)  becomes  positive,  it  must  remain  positive  and  bounded  away 
from  zero  as  long  as  (^(^),p(i))  £  Bt^q.  Hence  q{t)  will  be  positive  and  bounded  away  from  zero,  and 
as  long  as  {q{t),p(t))  £  Brj^j  q{t)  will  continue  to  increase.  Since  {q  :  {q,p)  £  Bj^^}  is  contained  in 
the  union  of  a  set  of  subintervals  of  uniformly  bounded  width,  {q{t),p{t))  must  exit  from  Brjo  after 
a  finite  amount  of  time.  This  proves  that  free  (unforced)  motions  of  (2)  do  not  get  stuck  in  regions 
of  the  phase  plane  where  /{q^  <  rjo.  Thus,  the  definition  of  elements  in  the  sequence  of  rectangular 
pulses  prescribed  in  the  statement  of  the  theorem  continues  with  the  gains  not  “getting  stuck”  at 
zero  until  the  system  approaches  the  desired  energy  level. 

Next  we  show  that  with  impulse  gains  defined  as  in  the  statement  of  the  theorem,  the  system 
moves  through  a  sequence  of  energy  levels  Hn,  such  that  Hn  Hd  SiS  n  increases.  The  first  energy 
transition  is  to 

Hi  =  Ho  +  A>^^Hu, 

=  Ho  +  AI^H  +  Oiw)  (by  Theorem  1) 

=  Ho  +  H4-Ho  +  0{w) 

=  Hd  +  Oiw). 

In  the  limit  as  it;  0,  it  would  be  possible  to  make  the  desired  energy  transition  in  a  single  step. 
For  small  iz;  >  0,  the  system  moves  very  nearly  to  the  desired  energy  in  the  first  step.  After  this, 
the  result  of  the  next  rectangular  pulse  is  a  transition  to  the  new  energy  level 

H2  =  Hi  +  A’=^H^. 

Rather  than  invoke  Theorem  1  again  at  this  point,  we  refine  our  estimate  of  A^'^Hu,.  One  can  show 
that  for  ki  defined  as  in  the  statement  of  the  theorem  that 

A’‘*H^=A\'H  +  M  ■{Hd-Hi-i)-w  +  o{kiw),  (13) 

where  M  is  a  constant  which  maybe  chosen  to  be  independent  of  (g£_i,pf_i).  Since  Hd^Hi  =  0{w), 
we  find 

H2  =  Hi+  A]^H  +  M  ■  0{w^)  +  o{kiw) 

=  Hi  +  Hd-Hi  +  0{w^) 

=  Hd  +  0{w^). 

Reapplying  this  type  of  analysis,  it  is  possible  to  conclude  that  Hd  —  Hn  =  0{w^)^  proving  the 
theorem.  □ 


There  are  two  important  ways  in  which  the  rectangular  pulses  we  take  as  our  inputs  must  ac¬ 
tually  differ  from  ^-function  idealizations.  First,  because  time  is  quantized  in  our  digital  hardware, 
there  is  a  minimum  pulse- width  which  can  be  achieved.  For  the  purposes  of  the  present  paper, 
we  shall  eissume,  however,  that  our  pulse  widths  w  may  be  chosen  as  small  as  needed.  There  are 
also  magnitude  constraints  on  the  forces  which  our  actuators  may  produce.  Equation  (9)  does  not 
account  for  possible  limitations  on  the  magnitude  of  k^  jw.  It  is  possible  that  for  some  values  of  k^  ^ 
the  input  k^  jw  may  not  be  physically  realizable  by  the  source  of  actuation.  In  this  case,  a  longer 
sequence  of  rectangular  pulses  Ui{p^q^t)  may  be  required  to  achieve  a  desired  energy  state. 


To  account  for  constraints  on  the  output  magnitude  of  the  system’s  actuator,  we  must  make 
appropriate  modifications  to  the  control  strategy  of  Theorem  2.  Let  the  maximum  allowable  gain 
k  be  denoted  kmax^  and  assume  the  restrictions  on  input  gain  are  symmetric  with  respect  to  sign. 
Then,  the  control  input  it(f )  is  adjusted  to  account  for  this  maximum  value: 


u{t)  = 


isgn  {y)  ■  min  (||^^||,  kmax)  t  €  [<',  t'  +  w) 
0  t  ^  \t' ,  t'  +  w) 
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We  again  control  the  system  by  prescribing  a  sequence  of  rectangular  pulses  applied  at  a  sequence  of 

times  ^0  <  < _ Under  the  hypotheses  of  Theorem  2,  it  is  still  possible  to  use  magnitude-limited 

rectangular  pulses  to  steer  the  system  (2)  between  chosen  energy  levels.  This  is  made  precise  in  the 
next  theorem. 


Theorem  3  Suppose  there  is  given  a  Hamiltonian  H{pjq;  U)  as  in  (1)  with  corresponding  equations 
of  motion  (2).  We  further  suppose  that  /(•)  and  V{-)  are  analytic  functions  and  that  for  some  rjo  >  0 

(i)  For  0  <  Tj  <  T]0f  the  set  {q  :  f{q)^  <  t]}  is  composed  of  disjoint,  open,  finite  intervals,  each  of 

length  at  most  €{7])  where  6(77)  =  0{t])  as  rj  ^  0,  and 

(ii)  {q  :  fiq?  <  r)o}  n  {q  :  ||^(g)|  <  770}  =  0- 

Let  Ho  be  some  initial  energy  state  and  Hd  some  desired  final  energy  state.  Let  w  >  0  be  given  such 
that  w  1.  Consider  a  sequence  of  times  to  <  <  ^2  <  •  •  •  such  that  {U  —  U^i)  >  w,  and  define 
the  sequence  of  rectangular  pulses 


Ui 


min 

0 


milkrna.) 


i  E  [ii^i,ti-i  +u;) 
i  E  [tt~i  +  u;,  tf) 


(14) 


where  kj  is  sequence  of  ideal  gains  defined  in  theorem  2.  This  sequence  of  rectangular  pulse  inputs 
steers  the  system  (2)  through  a  sequence  of  energy  levels  Hi  such  that  as  i  increases  Hi  Hd^ 


Proof  [Sketch.)  Suppose  pulse  magnitudes  are  bounded  by  kmax^  The  change  in  energy  produced 
by  applying  the  rectangular  pulse  (of  amplitude  k  and  width  i/;) 


u 


k 

0 


t  E  [t^  t'  +  w) 
t  ^  +  w) 


to  (2)  is 


A^H^}  =  kwpf{q)  +  +  0(1:;). 


Essentially  the  same  argument  that  was  applied  to  prove  theorem  2  establishes  the  conclusion  of  the 
present  theorem.  Of  course,  convergence  is  slower  by  a  factor  of  0(1 /w).  □ 


Remark  3  The  two  theorems  proved  in  this  section  show  that  very  simple  pulse  modulated  control 
designs  may  be  applied  under  mild  assumptions  to  steer  a  single  degree  of  freedom  Hamiltonian 
system  of  the  form  (2)  between  desired  energy  curves  in  the  phase  plane.  By  combining  such 
pulse-modulated  steering  with  the  free  evolution  of  the  dynamics  along  constant  energy  curves,  it  is 
possible  to  steer  the  system  between  states  which  may  be  of  interest.  While  this  type  of  control  has 
been  shown  to  work  even  in  the  presence  of  constraining  amplitude  bounds  on  the  inputs,  it  remains 
to  show  how  effectively  pulse  modulated  control  may  be  applied  in  situations  where  speed  and  input 
energy  are  of  concern.  In  the  next  section,  we  address  the  question  of  optimal  timing  of  pulses. 


4  Impulsive  Control  of  the  Cart  and  Pendulum 

4.1  System  Model 

In  this  section,  we  consider  the  problem  of  controlling  the  cart  and  pendulum,  as  shown  in  Figure 
2.  In  the  figure,  0  represents  the  angular  displacement  of  the  pendulum  from  vertical,  and  qo  is 
the  position  of  the  cart.  M  and  m  are  the  masses  of  the  cart  and  pendulum,  respectively,  i  is  the 
length  of  the  pendulum,  and  g  is  the  gravitational  acceleration.  The  system  in  Figure  2  has  the 
Hamiltonian 

Hip,  q;  U)  =  \p-mtcosqU  (<)]^  -  +  mglil  -  cos  q)  (15) 
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Figure  2:  The  Cart  and  Pendulum 


Figure  3:  Maximal  and  minimal  application  points  for  the  cart  and  pendulum,  where  m  =  ^  ^  =  1. 

Note  that  a  minimal  application  point  does  not  exist  for  Hq  <  0.13. 


where  q  =  0  and  corresponding  equations  of  motion 

p  =  —j\p—m£cosqU{t)]smqU{t)  +  mgismq. 


(16) 

(17) 


4.2  Effects  of  Input  Shape  on  Change  in  Energy 

From  (15),  we  have 

F{pAyU)  -  -  Q(P“-  mfcosg ;/(<)) cos g  +  m;7(<)^ 


If  u{t)  =:  6{i  —  then 

A$H  =  -^(K^O  mi  cos  q{t'))  cos  q{t') 

From  (8),  the  maximal  application  points  are  those  points  (p,  q)  E  So  which  satisfy 

—  sin  5  cos  g  —  posing  =  psingcosg,  (18) 

where  p  =  p(g,/7o)  =  ±-^2(770  “  (i  *“  cos^)).  The  minimal  application  points  are  simply  those 
points  (p,  g)  E  So  for  which  AH$  ~  0.  Plots  obtained  numerically  for  the  maximal  and  minimal 
application  points  are  shown  in  Figures  3a, b,  where  m,  g,  and  £  were  all  taken  to  be  1.  Note  in 
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Figure  4:  (a)  Selected  Constant  Energy  Curves  and  (b)  Corresponding  AHyj  for  the  Cart  and 
Pendulum 


Figure  3b  that  a  minimal  application  point  does  not  exist  for  Hq  <  0.13.  As  shall  be  seen  in  the 
next  figure,  this  is  because  the  AHs  curve  lies  entirely  above  the  g-axis  for  energy  levels  smaller 
than  0.13.  The  number  of  solutions  for  each  changes  at  the  energy  Hq  =  where  the  orbit  Eq 
passes  through  (q^p)  =  (ib7r/2,0).  Nonlinear  controllability  analysis[10]  reveals  that  the  system  is 
neither  locally  nor  globally  accessible  at  these  states.  In  Figure  3a,  two  new  branches  of  solutions 
are  created,  both  of  which  terminate  at  ^  =  ±7r  within  the  plotted  range  of  Hq.  Another  branch 
of  solutions  at  g  =  ±7r  forms  for  Hq  >  2.  The  solution  corresponding  to  Hq  =  2  is  the  separatrix 
solution,  cLS  shall  be  seen  in  the  next  set  of  figures.  The  two  “bifurcated”  branches  of  solutions  in 
Figure  3a  ultimately  limit  on  the  q  =  ±7r  solutions.  In  Figure  3b,  a  new  branch  q  =  ±7r/2  is  created 
at  iJo  =  1)  and  exists  for  all  Hq.  Therefore,  an  input  applied  at  ^  =  ±7r/2  for  >  1  produces 
no  change  in  energy.  The  original  branch  of  solutions  in  Figure  3b  also  limits  at  g  =  ±7r,  but  does 
not  persist.  Like  q  =  ±7r/2  a,t  Hq  =  1,  the  value  of  Hq  where  q  =  ±7r  is  both  a  maximal  and 
minimal  application  point.  These  degeneracies  are  the  result  of  the  presence  of  cusps  in  AHs  at  the 
degenerate  points. 

Results  of  numerical  investigations  of  AH^^j  for  varying  and  w  are  shown  in  Figures  4  and  5. 
Figure  4a  shows  various  orbits  of  the  unforced,  undamped  pendulum  for  different  energies  Hq.  The 
images  of  the  orbits  in  Figure  4a  are  shown  in  Figure  4b.  The  curves  in  Figure  4b  were  not  deter¬ 
mined  by  direct  computation  of  ?o)-  Rather,  they  were  computed  by  running  simulations 

of  the  pendulum  system  with  a  pulse  applied  at  (pO)?o)j  and  then  measuring  the  change  in  energy. 
In  the  simulations,  parameter  values  m  =  g  =  £  =  I  and  w  =  0.01  were  used,  with  a  time  step  of 
0.001.  These  experimental  curves  closely  match  curves  obtained  from  AHs^  and  confirm  our  earlier 
observations  on  maximal  and  minimal  application  points. 

Figure  5  shows  the  effect  of  variations  in  given  initial  conditions  with  energy  Hq.  Parameter 
values  used  in  simulations  are  listed  in  the  following  table: 


Pulse  width  w 

0.01 

0.1 

0.5 

Mass  m 

1 

1 

1 

Gravity  g 

1 

1 

1 

Initial  energy  Hq 

1 

1 

1 

Integration  time  step 

0.001 

0.01 

0.05 

The  effect  of  increasing  w  is  to  shift  the  values  to  the  right  above  the  g-axis,  and  to  the  left 
below.  It  is  also  clear  that  the  shifts  of  A/fo.oi  and  AHq,i  were  accurately  predicted  by  Theorem  1. 
Note  that  the  curves  for  AH^  that  lie  above  the  x-axis  are  for  negative  values  of  p,  and  similarly 
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Figure  5:  AHw  vs.  q  for  the  Cart  and  Pendulum,  J^o  =  1 


below  the  g-axis  for  positive  p.  The  pulses  used  in  these  calculations  were  positive,  meaning  the  input 
was  a  positive  acceleration  of  the  cart.  We  see  that  the  pendulum  experiences  an  increase  in  energy 
when  the  cart  accelerates  opposite  the  direction  of  the  motion  of  the  pendulum,  and  a  decrease  when 
the  cart  accelerates  in  the  same  direction.  This  may  be  interpreted  cis  saying  that  the  acceleration 
the  pendulum  feels  is  due  to  the  change  in  relative  velocities  of  the  cart  and  pendulum.  Also  clear 
from  the  plot  is  a  decrease  in  the  maximum  change  in  energy  with  increasing  w,  as  evidenced  in  the 
plot  of  A/fo.5- 


4.3  Stabilization  of  the  Inverted  Equilibrium  -  Simulation  Results 

We  now  simulate  the  controlled  stabilization  of  the  inverted  equilibrium  of  the  pendulum.  Because 
any  lab  experiment  would  require  us  to  account  for  damping,  we  also  consider  impulsive  control  of 
the  damped  cart  and  pendulum  under  the  assumption  that  damping  is  small.  Given  that  our  control 
synthesis  is  based  on  precise  knowledge  of  the  separatrix  energies  of  the  system,  we  would  expect 
that  impulsive  control  would  fail  in  the  presence  of  damping.  We  shall  see  that,  in  the  absence  of 
any  compensation  for  damping,  impulsive  control  does  fail  in  the  damped  case.  It  performs  well 
enough,  however,  that  a  hybrid  control  scheme  can  be  developed  which  features  the  simplicity  of 
impulsive  control  for  swing^up  and  standard  linear  control  for  stabilization. 

The  results  of  the  simulations  are  depicted  in  Figures  6  and  7.  Parameter  values  used  in  the 
simulations  are  listed  in  the  following  table: 


Parameter 

Simulation  1 

Simulation  2 

Mass  m 

1 

1 

Gravity  g 

1 

1 

Damping  coefficient  c 

0 

0.05 

Gain  Gp 

5 

5 

Initial  po 

0 

0 

Initial  go 

0 

0 

Desired  energy  Hd 

2 

2 

Pulse  width  w 

0.01 

0.01 

Duty  cycle  {ti  —  ti-i) 

0.25 

0.25 

Integration  time  step 

0.001 

0.001 
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Figure  6:  Phase  Portraits  for  Undamped  and  Damped  Systems  with  Impulsive  Control 


Figure  7:  u{t)  vs.  f  for  the  Undamped  and  Damped  Systems 


Both  the  undamped  and  damped  systems  move  near  the  desired  energy  within  a  small  number  of 
iterations  (in  this  case,  four),  as  predicted.  The  undamped  system  continues  to  move  towards  the 
separatrix,  and  stabilizes  at  (— tt,  0).  The  rapidity  of  convergence  is  seen  in  Figure  7,  where  input 
magnitude  has  fallen  to  less  than  0.01  within  three  integration  seconds.  The  damped  system  never 
converges  to  the  desired  energy.  As  seen  in  the  figure,  input  magnitudes  fall  off  quickly  as  the  system 
approaches  the  separatrix,  but  oscillate  about  u{t)  =  0  with  small  magnitude  for  the  remainder  of 
the  simulation.  However,  the  trajectory  does  pass  close  enough  to  the  unstable  fixed  points  to  switch 
to  linear  control. 


4.4  Stabilization  of  the  Inverted  Equilibrium  -  Experimental  Results 

Recent  experiments  in  the  Boston  University  Robotics  Lab  have  verified  that  impulsive  controls 
may  be  used  in  conjunction  with  linear  control  to  regulate  swing-up  and  stabilize  the  cart  and 
pendulum  experiment  at  the  inverted  equilibrium.  A  photograph  of  the  experimental  apparatus  and 
a  schematic  of  the  system  appear  in  Figure  8.  The  effective  pendulum  length,  when  it  is  modelled 
as  a  point  mass  pendulum,  is  0.25  meters. 

4.4,1  Control  Synthesis 

The  control  we  choose  for  the  pendulum  swing-up  maneuver  involves  applying  impulses  at  the  max¬ 
imal  application  point  of  q{ti)  =  0  radians.  The  impulse  application  times  ti  are  determined  as 
the  experiment  progresses.  Each  of  these  energy-pumping  impulses  increases  the  energy  of  both  the 
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Figure  8:  Cart  with  pendulum  apparatus:  (left)  photograph  and  (right)  schematic. 


pendulum  and  the  cart.  The  finite  track  length  for  the  experimental  apparatus  shown  in  Figure  8, 
however,  imposes  restrictions  on  the  shape  of  the  control  input.  After  each  energy-pumping  impulse, 
a  counter-input  must  be  applied  to  halt  the  cart  before  it  crashes  into  its  limit  stop.  To  achieve 
the  goal  of  inverting  and  balancing  the  pendulum,  the  counter-input  must  be  designed  such  that  it 
does  not  remove  more  energy  from  the  pendulum  than  was  provided  by  the  energy-pumping  impulse. 

We  choose  for  the  counter-input  a  low-amplitude  pulse  of  relatively  long  duration.  The  area  of 
the  counter-pulse  is  set  to  be  equal  to  that  of  the  energy-pumping  impulse  so  that  the  counter-pulse 
will  have  an  equal  but  opposite  effect  on  the  cart  dynamics.  The  long  duration  of  the  counter-pulse 
ensures  that  the  pendulum  is  farther  away  from  the  maximal  application  point  during  application 
than  it  was  for  the  energy-pumping  impulse.  This  causes  it  to  have  less  of  an  effect  on  the  pendulum 
energy.  The  net  effect  of  each  impulse/counter-pulse  pair  is  to  increase  the  pendulum  energy. 

After  the  impulsive  input  sequence  has  inverted  the  pendulum,  a  linear  regulator  is  switched  on 
to  stabilize  the  system  about  its  inverted  position.  The  details  for  the  design  of  such  a  regulator 
may  be  found  in  [8]. 

4.4.2  Experimental  Results 

The  results  of  the  control  algorithm  may  be  seen  in  the  pendulum  phase  plot  of  Figure  9.  The  initial 
condition  of  the  pendulum  is  {q,q)  =  (-0.30,0.00).  The  pendulum  velocity  data  is  noisy  because 
it  was  calculated  from  position  encoder  readings.  Despite  the  noise,  the  increase  in  Hamiltonian 
energy  due  to  the  impulsive  inputs  may  be  seen  as  an  increase  in  radius  each  time  the  angle  of  the 
pendulum  crosses  q  =  0  radians. 

The  corresponding  control  input  shown  in  Figure  10a  is  the  acceleration  of  the  cart.  As  noted 
in  Theorem  1,  square- wave  approximations  to  the  delta  function  have  an  upper  bound  on  their 
duration,  w.  To  focus  the  effect  of  the  input  near  the  maximal  application  point,  the  width  of  each 
impulse  is  set  to  be  inversely  proportional  to  the  pendulum  velocity,  q{ii).  The  impulses  in  Figure 
10a  may  be  seen  getting  taller  and  narrower  as  the  experiment  progresses.  Immediately  following 
each  impulse  in  this  graph  may  be  seen  the  related  low-amplitude  counter-pulse. 

Figure  10b,  which  displays  the  cart  velocity,  shows  that  the  counter-pulses  indeed  have  an  oppo¬ 
site  effect  on  the  cart  energy  by  driving  the  velocity  back  down  to  zero.  The  abrupt  change  in  cart 
velocity  at  ^  =  4.75  seconds  reflects  the  action  of  the  linear  control  law  as  the  pendulum  enters  the 
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Figure  9:  Pendulum  phase  plot  showing  effect  of  impulsive  inputs  and  stabilization  at  the  inverted 
equilibrium.  Following  the  path  of  the  pendulum  in  its  clockwise  spiral  from  the  origin,  an  increase 
in  radius  (and  hence,  energy)  is  evident  each  time  the  pendulum  received  an  impulsive  input  as  it 
crossed  ^  =  0  radians.  A  linear  regulator  stabilized  the  pendulum  about  its  inverted  equilibrium 
when  the  pendulum  entered  its  region  of  stability  at  (-7r,0). 


region  of  stability  of  the  linear  regulator.  Figure  10c  shows  the  position  of  the  cart  as  a  function 
of  time.  Note  that  the  entire  swing-up  maneuver  required  a  total  of  less  than  10  cm  of  cart  track 
length. 

Experimental  results  on  controlling  a  swing-up  maneuver  and  balancing  the  pendulum-on-a-cart 
system  have  so  far  been  very  encouraging.  In  preparing  the  full  conference  version  of  this  paper,  it 
is  planned  to  provide  comparisons  with  other  approaches  to  swing-up  and  balancing  control.  (Cf. 
[4].)  Short  of  this  at  the  present  time,  we  note  two  advantages  of  our  impulsive  control  strategy:  (i) 
The  implementation  required  very  few  details  about  the  physical  characteristics  of  the  pendulum- 
on-a-cart  system;  no  effort  to  identify  physical  parameters  was  involved,  (ii)  As  Theorem  2  indicates 
and  experiments  confirm,  magnitude  limits  on  the  impulsive  inputs  have  little  effect  on  the  system’s 
ability  to  perform  the  swing-up. 


5  Conclusion 

This  paper  has  presented  preliminary  results  of  a  research  program  aimed  at  developing  a  theory 
of  pulse-modulated  control  of  mechanical  systems.  While  our  aim  has  been  to  develop  a  theory 
which  can  be  applied  to  a  wide  class  of  mechanical  systems  (See  [1]!),  the  emphasis  here  has  been  on 
velocity-controlled,  single  degree  of  freedom  Hamiltonian  systems.  We  have  introduced  an  energy- 
based  global  control  method  for  such  systems  based  on  inputs  which  approximate  6-functions.  In 
§2,  we  characterized  the  response  of  Hamiltonian  systems  to  impulsive  inputs.  Using  the  change 
of  energy  produced  by  a  6-function  input  as  a  start  for  our  analysis,  we  derived  an  estimate  of 
the  change  in  energy  produced  by  a  narrow  rectangular  pulse.  By  viewing  our  “change  in  energy 
function”  AHv,  as  an  operator  on  orbits  of  the  unforced  system,  we  derived  expressions  defining 
those  points  on  the  orbit  at  which  an  impulsive  input  would  produce  the  meiximum  and  minimum 
changes  in  energy.  This  information  was  used  in  §3  as  a  basis  for  the  synthesis  of  control  laws.  The 
proposed  control  law  consists  of  a  sequence  of  rectangular  pulses  which  produce  a  desired  net  change 
in  energy.  Under  some  mild  cissumptions,  we  showed  that  a  finite  number  of  appropriately-sized  in¬ 
puts  produces  the  desired  finite  change  in  energy.  The  theory  of  §2  and  §3  was  applied  to  the  cart 
and  pendulum  in  §4.  In  simulations  and  laboratory  experiments,  our  simple  control  law  performed 
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Figure  10:  (a)  Acceleration,  (b)  velocity,  and  (c)  position  of  the  cart  vs.  time  for  swing-up  of 
pendulum. 


pendulum  swing-up,  and  in  the  simulations  was  able  to  steer  the  system  to  the  stable  manifold  of  the 
inverted  equilibrium.  In  the  experiments,  impulsive  control  steered  the  system  into  a  neighborhood 
of  the  inverted  equilibrium,  where  a  linear  controller  performed  the  final  stabilization. 

What  makes  this  method  of  control  attractive  in  these  types  of  problems  is  its  simplicity,  which 
is  best  illustrated  in  the  cart  and  pendulum  application.  In  future  publications,  we  will  compare 
the  performance  of  our  swing-up  controller  to  other  control  methods.  One  of  our  goals  is  to  extend 
the  analysis  presented  here  to  n-DOF  Hamiltonian  systems.  Because  of  the  nontrivial  dynamics  of 
multi-degree-of-freedom,  coupled  systems  like  kinematic  chains,  it  is  not  completely  clear  how  the 
theory  presented  here  extends  in  the  n-DOF  case.  We  have  found,  however,  that  it  is  possible  to 
extend  the  theory  (with  some  modification)  under  certain  conditions.  This  work  has  been  shown 
to  be  useful  in  rigid  body  reorientation.  We  shall  present  this  analysis  in  subsequent  publications. 
Finally,  current  work  has  shown  that  impulsive  controls  can  be  used  in  conjunction  with  open- 
loop,  “structure-creating”  controls,  such  as  high  frequency  periodic  inputs  (See  [2], [3], [6],  and  [11].)) 
to  form  closed-loop  hybrid  controllers.  In  this  scenario,  the  impulsive  control  steers  the  system 
around  the  phase  space  structure  created  by  the  open-loop  forcing.  Details  on  this  work  will  appear 
elsewhere. 
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Figure  1:  Pendulum- driven  rimless  wheel 
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Abstract  -  Tad  McGeer  had  encouraging  success  in  studying  a  rimless  wheel  rolling  down  a  slope  as  a 
precursor  to  studying  passive  bipedal  walking  down  a  slope.  In  an  effort  to  understand  the  method  by  which 
an  otherwise  passive  walking  robot  might  be  controlled  by  a  motorized  pendulum  mounted  to  its  hip,  we 
present  here  a  study  of  a  pendulum-driven  rimless  wheel,  where  a  motor  holds  the  pendulum  at  a  constant 
angle  with  respect  to  world  coordinates.  This  document  describes  the  derivation  of  the  system  dynamics 
equation.  Part  of  the  dynamics  is  a  momentum-conserving  collision  at  each  successive  foot  fall.  This  collision 
equation  describes  the  transition  of  the  system  into  the  next  step.  Also  described  is  a  method  of  finding  the 
steady-state  trajectory  through  iterative  simulation.  Simulation  data  for  a  sample  system  is  presented  as  an 
aid  to  the  reader’s  tuition. 


1  INTRODUCTION 

The  first  task  performed  by  Tad  McGeer  in  developing  his  passive  biped  robot  was  to  study  the  passive 
dynamics  of  a  rimless  wheel  rolling  down  a  slope  [3].  From  his  study  he  found  average  wheel  speeds  for 
varying  values  of  slope  angle,  wheel  spoke  separation,  and  wheel  moment  of  inertia.  This  data  helped  him 
select  physical  parameters  for  his  biped  walker  such  that  the  leg  swing  period  would  match  the  speed  of  the 
walker.  A  similar  procedure  will  allow  us  to  design  a  pendulum-driven  biped  walker. 

Figure  1  shows  the  pendulum-driven  rimless  wheel  system.  The  rimless  wheel  is  unactuated  at  the 
ground  level  and  is  free  to  rotate  about  the  stance  spoke  without  slipping.  A  motor  connecting  the  pendulum 
to  the  wheel  may,  in  general,  force  the  pendulum  to  follow  an  arbitrarily  specified  trajectory.  For  this  first 
analysis,  however,  the  motor  is  used  to  maintain  the  pendulum  at  a  constant  angle,  k,  with  respect  to  the 
horizontal.  Figure  1  shows  the  wheel  rotating  about  the  current  stance  spoke,  during  which  time  its  behavior 


is  governed  by  the  equation  of  motion  (derived  in  section  2).  At  the  instant  the  next  spoke  hits  the  ground, 
however,  the  impulsive  collision  will  instantaneously  change  the  speed  of  the  wheel.  To  keep  the  pendulum 
at  the  constant  angle  /c,  the  motor  must  simultaneously  apply  an  impulse  to  adjust  the  pendulum  speed  to 
match  the  new  wheel  speed.  The  collision  equation  is  derived  in  section  3. 

Section  4  describes  the  iterative  simulation  method  of  finding  the  steady  state  trajectory  and  features 
some  simulation  results  for  a  sample  rimless  wheel  system. 


2  DYNAMICS  EQUATIONS 

The  equations  of  motion  for  the  pendulum-driven  rimless  wheel  are  generated  by  calculating  the  Lagrangian 
for  the  system,  then  taking  the  proper  derivatives  to  form  Lagrange’s  Equations.  This  section  reports,  briefly, 
on  the  details  of  finding  the  equations  of  motion. 

Figure  2  shows  the  vectors  and  physical  parameters  needed  to  derive  the  equations  of  motion  for  the 
pendulum-driven  wheel  system.  All  spokes  except  for  the  stance  spoke  and  the  collision  spoke  have  been 


Figure  2:  Parameters  for  the  pendulum-driven  wheel 


omitted  for  clarity.  The  wheel  has  mass  m  and  moment  of  inertia  I.  The  angle  between  two  consecutive 
spokes  of  the  rimless  wheel  is  2ao.  The  pendulum  is  a  point  mass  with  mass  km'^  at  the  end  of  a  massless 
pendulum  shaft  of  length  The  vector  rn  points  to  the  center  of  mass  of  the  wheel  with  respect  to  the 
stance  contact  point.  Similarly,  the  vector  fi2  points  to  the  tip  mass  of  the  pendulum  bob  with  respect  to 
the  stance  contact  point.  The  vectors  are  given  by 

Ui  =  (sinji  i  +  icosji  k 

fi2  =  [^sin7i  -{-  kiisin  (71  +  72)]  i  +  cos 71  +  keicos  (71  +  72)]  k 

The  velocities  of  the  wheel  center  of  mass,  Fn,  and  the  pendulum  tip  mass,  Vu,  may  be  found  by  taking  the 
time  derivative  of  rn  and  fi2,  respectively. 

The  kinetic  energy  of  the  wheel  is  given  by 

For  simplification  purposes,  we  define  the  radius  of  gyration  of  the  wheel,  Vgyr,  to  satisfy  the  equation 
I  =z  mPr^y^.  Then  the  kinetic  energy  for  the  wheel  becomes 

Tvjheel  = 


(1) 


The  potential  energy  of  the  wheel  is  due  to  the  force  of  gravity  and  is  given  by 


^wheel  — 


(2) 


where  Vn^  is  the  vertical  component  of  the  rn  position  vector.  In  the  same  fashion,  the  kinetic  and  potential 
energies  of  the  pendulum  may  be  found  to  be 

Tpend  = 

Vpend  ^  ^771^5^^12, 


The  Lagrangian,  £,  for  the  system  is  given  by  the  sum  of  all  the  kinetic  energies  minus  the  sum  of  all 
the  potential  energies: 

Since  there  is  no  form  of  actuation  between  the  ground  and  the  stance  leg,  while  a  motor  with  torque  r 
actuates  the  pendulum  with  respect  to  the  wheel,  Lagrange’s  equations  for  the  system  are 

d  dC  dC 

d  dC  dC 

In  this  work,  it  is  assumed  that  the  motor  is  driven  such  that  the  pendulum  maintains  a  constant  angle, 
AC,  with  respect  to  the  horizontal.  From  figure  2  we  see  that  the  relation  between  /c,  71,  and  72  is 

72  =  I  -  71  + 


=  0  (3) 

=  (4) 


For  the  pendulum  to  remain  motionless,  further,  ^  Plugging  these  relations  into  the  first  equation 

of  motion,  equation  3,  we  find  that  the  acceleration  of  the  wheel  is 


d^7i  _  fcmfe^cos(7i  -  4-  f(l  +  fcni)sin7i  -  fkmkicosK 

dt^  1  +  -  kmh  sin  (71  -  k) 

To  simplify  the  equations,  a  new  time  variable  is  introduced: 

^new  ~  ^ 

Letting  “dots”  denote  differentiation  with  respect  to  tnewy  equation  5  becomes 

..  _  kjnh  cos  (71  -Ac)7i^-f  (l  +  fem)  Siu  7i  -kmke  cos  AC 
'>'1  ~  l+km+rly^-kmkesin{ji-K) 


(5) 


(6) 


Note  that  equation  6  is  independent  of  the  wheel  mass,  gravitational  acceleration,  and  leg  length.  The 
manner  in  which  the  variables  have  been  chosen  requires  that  just  five  parameters  be  specified  to  completely 
define  the  pendulum-driven  system:  (ao,  km^h,  k). 


3  COLLISION 

When  the  next  spoke  of  the  rimless  wheel  collides  with  the  ground,  the  system  will  instantaneously  lose 
speed.  The  following  analysis  of  the  system  collision  follows  one  outlined  by  Hurmuzlu  and  Chang  for 
planar  kinematic  chains  [2].  It  is  based  upon  the  laws  of  conservation  of  angular  impulse  and  momentum. 


conservation  of  linear  impulse  and  momentum,  an  assumption  on  the  elasticity  of  the  collision,  the  frictional 
relationship  between  the  horizontal  and  vertical  components  of  the  impulse  delivered  to  the  wheel  by  the 
ground,  and  the  desired  relationship  between  the  final  speeds  of  the  wheel  and  the  pendulum. 

Figure  3  shows  the  three  phases  of  the  collision  which  must  be  considered.  Figure  3a  shows  the  wheel  an 
instant  before  collision.  The  wheel  is  rotating  about  the  left  spoke  with  known  speed  71  while  the  pendulum 
is  rotating  about  the  hub  with  known  speed  72.  The  right  spoke  is  shown  as  a  dotted  line  to  emphasize  that 
it  is  not  yet  supporting  the  weight  of  the  wheel.  At  the  moment  of  impact,  shown  in  figure  3b,  the  ground 


a)  Before  Collision  b)  Collision  c)  After  Collision 

Figure  3:  Parameters  of  interest  during  collision 

under  the  collision  foot  applies  an  impulsive  force  with  unknown  components  Fx  and  Fz^  To  maintain  a 
desired  relationship  between  the  angular  speeds  of  the  wheel  and  pendulum,  the  motor  applies  an  impulsive 
torque  with  unknown  magnitude  r.  For  simplicity,  let  these  impulsive  forces  be  represented  by  Dirac  delta 
functions:  F{t)  =  F  S{t)^  r{t)  =  r  S{t).  After  the  collision,  figure  3c,  the  stance  leg  lifts  off  the  ground  with 
unknown  speed  components  dX  and  dZ,  while  the  wheel  and  pendulum  have  two  new  unknown  rotational 
speeds,  and  72-  The  assumptions  in  this  collision  analysis  are  1)  the  impact  is  instantaneous  point  impact, 
2)  the  geometry  of  the  system  is  unchanged  during  impact,  and  3)  all  non-impulsive  forces  (due  to  gravity 
or  centripetal  acceleration)  are  small  compared  to  the  impulses  and  may  be  neglected.  Note  that  all  primed 
(')  values  in  this  analysis  represent  after-collision  values. 

We  must  solve  for  a  vector  with  seven  unknowns: 

^  =  i'yi>i2>dX,dZ,Fx,Fz,T)'^ 

The  only  two  known  quantities  we  have  are  the  speeds  of  the  wheel  and  pendulum  prior  to  impact:  7  = 
(71,72)-  We  may  construct  the  matrix  equation  Au  =  with  knowledge  of  the  following  conservation  laws 
and  relations. 

Angular  Impulse  and  Momentum  -  The  principle  of  impulse  and  momentum  states  that  the  angular 
momentum  of  a  system  is  only  changed  by  the  application  of  an  external  torque: 

L  +  Te^tdt  =  L'.  (7) 

In  equation  7,  L  is  the  angular  momentum  of  the  system  at  time  0,  Text  Is  a  torque  external  to  the  system 
which  is  applied  between  times  0  and  and  L'  is  the  angular  momentum  of  the  system  at  time  t' , 

We  have  two  separate  angular  momentum  equations  we  may  consider  for  the  pendulum-driven  rimless 
wheel.  The  first  equation  involves  the  angular  momentum  of  the  system  about  the  stance  contact  point,  Lg. 
[We  could  have  chosen  any  arbitrary  point  to  calculate  the  angular  momentum  about,  but  this  choice  of  axis 
yields  relatively  simple  results.]  This  angular  momentum,  before  impact,  is  given  as: 

Ls  =  mPrly^'yi  +  m  (fn  x  tTn)  +  k^m  (ri2  x  1712) 


At  impact  the  torques  from  the  impulsive  forces  and  torque  shown  in  figure  3b  are  added  as  provided  by 
equation  7.  Note  that,  as  impulses,  they  require  no  further  integration.  The  torque  contribution  from 
the  ground  forces  is  found  by:  Torque  =  (r  x  F).  The  contribution  from  ground  force  is  zero  since 
the  direction  of  this  force  vector  goes  through  the  axis  of  rotation.  The  motor  torque  impulse,  f,  doesn’t 
affect  the  angular  momentum  of  the  system  because  its  torque  is  internal  to  the  system.  Hence  the  total 
external  torque  impulse  felt  by  the  system  (about  the  stance  point  axis)  is  given  by  (“2/^2  sin  71) 5(i).  The 
conservation  of  angular  momentum  equation  (7)  becomes 

Ls  +  J {—2lFySmji)6{t)dt  =  which  we  write  out  explicitly  as 

ml^rlyr'yi  -h  m  (fu  x  iTu)  +  kmm  (fi2  x  V12)  -  2lFy  sin  71  =  ml'^  r^yr^'i  +  m  (fn  x  vn')  +  kmm  (fi2  x  V12) 

(8) 

where  simple  vector  analysis  shows  that 


^11'  =  +  dXi  +  dZ  k, 

Vii  =  +  dXi  +  dZ  fc, 

^^new 


and  the  rotational  speeds  in  the  v'  velocities  are  the  ones  after  impact  (i.e.  (71,72))- 

The  second  equation  for  the  collision  analysis  involves  the  principle  of  angular  impulse  and  momentum 
(equation  7)  of  just  the  pendulum  about  the  wheel  hub.  The  pendulum’s  angular  momentum  just  before 
impact  is: 


Lp  =  kmm  ((fi2  -  5i)  X  V12) 

The  only  external  torque  contribution  which  the  pendulum  feels  (since  the  ground  forces  act  through  a  spoke 
and,  hence,  through  the  pendulum’s  axis  of  rotation)  is  the  motor  torque,  r.  The  conservation  equation  (7) 
thus  becomes 


Lp  -f 


kjnm  ((fi2  -  fii)  X  V12)  +  r 


l; 

kmm  ((fi2  -  rii)  X  V12) 


(9) 


Linear  Impulse  and  Momentum  -  The  principle  of  linear  impulse  and  momentum  contributes  two 
additional  equations.  The  first  equation  obtained  from  this  principle  states  that  the  total  horizontal  linear 
momentum  of  the  system  may  only  be  modified  by  forces  in  the  horizontal  direction.  Given  that  the  linear 
momentum  in  the  horizontal  direction  before  and  after  collision  is  given  by 


Ps,  =  invii^ -]rk,nmvn,,  and 
Ps^  =  mvii^  +  kmmvuj, 


the  conservation  of  horizontal  linear  impulse  and  momentum  equation  is 

Ps.  + 1  Fj{t)dt  -  Ps: 
mvii^+kmmvi2^+Fx  =  mvn^  +  kmmvn^ 


(10) 


Similarly,  another  equation  comes  from  consideration  of  the  linear  momentum  of  the  system  in  the 
vertical  direction: 

Ps.  +  J  F.J{t)dt  =  Ps' 

mvii^  +  kmmvi2.  ■¥  Fz  = 


P  ' 

Sr 

mvn^  +  kmm  V12.' 


(11) 


Coefficient  of  Restitution  -  The  fifth  system  equation  is  the  coefficient  of  restitution  equation-a  general 
collision  equation.  The  coefficient  of  restitution  is  a  representation  of  how  much  “bounce”  there  is  in  the 
system.  In  general,  the  restitution  equation  takes  the  form: 

VBj  +  evB,=0,  (12) 

where  vb^  is  the  vertical  velocity  of  the  collision  spoke  w.r.t.  an  inertial  frame.  The  assumption  we  will 
make  for  the  rimless  wheel  is  that  the  ground  collisions  are  completely  inelastic.  Hence  the  coefficient  of 
restitution,  e,  will  be  equal  to  zero  and  the  restitution  equation  will  become: 

vbJ  =  0 


Resolving  the  Impulse  Vector  -  The  sixth  system  equation  merely  relates  the  two  impulse  vectors  to 
each  other  by  an  unknown  factor  /i: 

~  V  '  (13) 

This  factor  will  relate  to  the  coefficient  of  friction  between  the  collision  spoke  and  the  ground. 

Rotational  Speed  Requirements  ~  The  final  system  equation  involves  our  desired  speed  relation  betw’een 
the  pendulum  and  the  wheel  after  collsion: 

72  =  (14) 

This  keeps  the  pendulum  at  absolute  angle  k  even  after  the  collision. 

SOLUTION  METHOD  -  Equations  8  through  14  may  be  combined  into  the  single  matrix  equation 
Au  =  Bj  in  the  following  form: 
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where,  for  compactness  of  notation,  we  have  labeled  the  following 


(15) 


=  -1  -^lyr  -  km  (l  +  2/:^  sin(ao  +  k)  -f  k]) 


A\2 

Aiz 


=  -kmkt{ki  +  8^(00  +  «)) 

(1  H-  km)  cos  Qo  +  kmkt  sin  k 


Ai4  = 


(1  +  km)  sin  Qo  H-  kmkt  cos  k 


In  solving  equation  15  the  first  thing  we  do,  as  discussed  before,  is  to  set  e  to  0,  which  represents  a  perfectly 
inelastic  collision.  Then,  since  it  is  undesirable  for  the  new  stance  spoke  to  slide  after  collision,  dX  is  set  to 
zero  and  equation  15  is  solved  for  the  unknown  force  coefficient  which  prevents  slipping,  fi: 

^  ~2  cos  Qo[(l  -h  km)  sin  gp  -  kmk  cos  k] 

^  [(1  4-  A:r72)(l  +  cos  2ao)  +  2r2^,.  -}-  2A:„iA:/  cosao  sin  k] 

Note  that  this  force  coefficient  has  the  same  relationship  as  one  would  expect  of  the  friction  relationship: 
F friction  =  static Fnormah  If  |/^|  <  f^static  for  the  surface  then  the  friction  force  is  sufficient  and  the  wheel 
does  not  slip.  After  checking  that  the  calculated  /x  is  satisfactory,  it  is  plugged  back  into  equation  15  to  solve 
for  the  remaining  unknowns,  including  7^: 


(l+fcm)  cos2ao+rg„r+fcmfc;  sin  (ap+K;) 
0-+km)+rjyr-f^mh  siXl  (oo-k) 


(17) 


Hence  the  speed  of  the  pendulum-driven  rimless  wheel  after  collision  is  simply  a  function  of  the  system 
parameters  {otoy'f'gyr.kmykt^n).  The  collision  efficiency,  r;,  indicates  how  much  wheel  speed  remains  after 
each  collision.  For  example,  if  the  wheel  had  a  complete  rim,  rather  than  a  finite  number  of  individual 
spokes,  ao  would  be  zero  and  t]  would  be  one. 


4  SIMULATION  RESULTS 


In  this  section,  we  define  a  default  wheel  with  pendulum  system.  We  describe  the  procedure  used  to  find  the 
steady-state  trajectory.  We  analyze  the  collision  equation  for  the  default  system  to  find  the  pendulum  angle 
which  yields  the  maximum  collision  efficiency.  We  present  the  results  of  a  battery  of  system  simulations 
which  show  the  steady-state  period  versus  pendulum  angle. 

One  problem  with  simulating  this  system  is  the  large  number  of  system  parameters  which  must  be 
selected.  There  are  five  separate  parameters  which  must  be  specified  before  the  pendulum-driven  rimless 
wheel  system  is  defined:  {ao,rgyr,  km,ke,K).  The  temporary  solution  arrived  at  is  to  fix  four  of  the  parameters 
and  call  it  the  default  system.  Table  1  shows  the  parameters  used  for  the  default  system.  While  the  results 
from  this  default  system  will  not  be  completely  general,  they  are  representative  of  a  probable  final  system 
design. 


Parameter 

Value 

ao 

0.314159 

0.3 

0.1 

ki 

0.5 

Table  1:  Parameter  values  for  default  rimless  wheel  system. 


Steady-state  speed  -  A  complete  system  simulation  cycle  starts  with  the  stance  spoke  at  7  =  alphoo  and 
the  wheel  at  some  initial  rotational  velocity.  The  equation  of  motion,  equation  6,  is  integrated  until  the 
stance  spoke  angle  reaches  Qo,  which  indicates  that  the  next  spoke  is  ready  to  collide  with  the  ground.  At 
this  point  the  after-collision  velocity,  7',  is  calculated  from  equation  17.  The  simulation  may  be  continued 
by  restarting  the  system  at  7  ~  alphcio  with  the  new  value  of  rotational  velocity.  The  steady-state  trajectory 
for  the  system  may  be  found  by  iterating  the  simulation  until  the  solution  stabilizes. 

Figure  4a  shows  the  steady-state  wheel  trajectory  of  the  default  system  with  /c  =  0  for  two  cycles  of 
the  motion.  Figure  4b  shows  the  corresponding  velocity  profile  of  the  motion.  At  collision  =  2.277) 

the  wheel  loses  15.3  %  of  its  speed. 


(a) 


'yij'new') 


(b) 


tnew 


Figure  4:  Steady-state  trajectory  of  default  system  with  k  =  0 


One  fruitful  question  we  might  ask  regards  the  maximum  collision  efficiency,  rj,  possible  for  the  default 
system.  The  higher  the  collision  efficiency,  after  all,  the  less  energy  is  lost  at  each  step.  Figure  5  shows  a 
plot  of  collision  efficiency  vs.  pendulum  angle  for  the  default  rimless  wheel  system.  Differentiating  equation 
17  with  respect  to  k  yields  a  maximum  collision  efficiency  of  0.848  at  k  =  0.248  rad. 


Figure  6  shows  simulation  data  for  the  steady-state  period  of  the  default  system  as  a  function  of  the 
pendulum  angle,  k.  Since  the  maximum  efficiency  was  at  /c  =  0.248  rad,  one  might  expect  that  the  minimum 


K 

Figure  6:  Plot  of  steady-state  cycle  period  vs.  pendulum  angle  for  default  system 

cycle  period  (and  hence  the  maximum  steady-state  wheel  speed)  in  figure  6  would  be  located  near  k  ~  0.248 
rad.  However,  figure  6  shows  that  the  maximum  wheel  speed  occurs  very  close  to  /€  —  0  rad,  specifically 
at  K  =  0.02  rad.  This  is  because  the  acceleration  of  the  wheel  during  the  dynamics  phase  of  the  motion 
is  maximized  by  k  =  0  rad.  It  is  apparent  that  the  pendulum’s  effect  on  the  dynamics  equation  is  more 
significant  than  its  effect  on  the  collision  efficiency  equation.  Figure  7  shows  the  pendulum-driven  rimless 
wheel  system  in  the  two  competing  configurations.  In  figure  7a,  the  pendulum  angle  maximizes  the  collision 
efficiency.  Figure  7b  shows  the  pendulum  angle  maximizing  the  steady-state  speed  of  the  wheel. 


5  CONCLUSION 

This  paper  described  the  pendulum-driven  rimless  wheel  system.  It  derived  the  dynamics  equation  and  the 
collision  equation  for  such  a  system  in  the  special  case  where  the  motor  maintains  the  pendulum  at  a  constant 
angle  relative  to  horizontal.  Simulation  results  were  presented  to  show  that  the  steady-state  trajectory  of 
such  a  system  may  be  found  by  iterative  simulation.  The  simulation  results  also  showed  that  the  maximum 
steady-state  speed  of  the  wheel  will  be  arrived  at  when  the  pendulum  is  held  out  nearly  horizontally. 


a)  Maximum  efficiency  b)  Maximum  speed 

Figure  7:  Pendulum  configurations  which  maximize  different  values  for  the  default  system 
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